Oligonucleotide sequence formula for labeling oligonucleotide probes and proteins for in-situ analysis

ABSTRACT

The present invention provides oligonucleotide probes and oligonucleotide probe collections and protein labeling for detecting or localizing a plurality nucleic acid target genes or antigens within a cell or tissue sample. Specifically, the invention provides collections of oligonucleotide probes for use in in situ hybridization analyses in which each probe has a label-domain with the sequence formulas of (CTATTTT) n CT, (AAAATAG) n  or (TTTTATC) n  or (GATAAAA) n  in which all cases “n” would equal 1 or greater. The present invention provides collections or “cocktails” of oligonucleotide probes for detecting or localizing specific nucleic acid target genes within a cell or tissue sample. The cocktails are useful for detecting the following: the Kappa gene (SEQ ID NOS: 1-16 inclusive); the Lamba gene (SEQ ID NOS: 17 through 29 inclusive); the CMV (cytomegalovirus) gene (SEQ ID NOS: 30 through 50 inclusive); EBER (Epstein-Barr early RNA) gene (SEQ ID NOS: 51-54 inclusive); Alu (SEQ ID NOS: 55-56); PolyA (SEQ ID NO: 57); and the detection tail (SEQ ID NO: 58).

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a U.S. National Phase application of PCT/US01/28014, filed Sep. 6, 2001.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates to oligonucleotide probes and collections of oligonucleotide probes for detecting or localizing nucleic acid genes targets within a cell or tissue sample. In particular, the invention relates to collections of oligoprobes.

2. Background of the Invention

In situ analysis includes in situ hybridization and immunohistochemistry. In situ hybridization (ISH) employs labeled DNA or RNA probe molecules that are anti-sense to a target gene sequence or transcript to detect or localize targeted nucleic acid target genes within a cell or tissue sample. ISH has proven to be a useful tool in a number of biomedical fields, including developmental biology, cell biology, and molecular biology. ISH has been used, for example, to diagnose genetic disorders, map genes, study gene expression, and localize sites of target gene expression.

Typically, ISH is performed by exposing a cell or tissue sample immobilized on a glass slide to a labeled nucleic acid probe which is capable of specifically hybridizing to a given target gene in the cell or tissue sample (In Situ Hybridization: Medical Applications (G. R. Coulton and J. de Belleroche, eds., Kluwer Academic Publishers, 1992); In Situ Hybridization: In Neurobiology; Advances in Methodology (J. H. Eberwine, K. L. Valentino, and J. D. Barchas, eds., Oxford University Press, 1994); In Situ Hybridization: A Practical Approach (D. G. Wilkinson, ed., Oxford University Press, 1992)). The hybridization of labeled probe molecules to nucleic acids in the cell or tissue sample can then be detected using, for example, radioactive-based direct detection methods, fluorescence-based direct detection methods, or indirect detection methods based on the binding of a fluorescence-labeled protein binding to a hapten such as BrdU, digoxigenin-labeled or biotin-labeled nucleotides incorporated into probes. Hapten-based methods have been further extended to include those molecules to be bonded by binding protein-enzyme conjugates such as antibody-enzyme-conjugates and colorimetric based detection chemistry. In addition, several target genes can be simultaneously analyzed by exposing a cell or tissue sample to a plurality of nucleic acid probes that have been labeled with a plurality of different nucleic acid tags. For example, a plurality of nucleic acid probes can be labeled with a plurality of fluorescent compounds having different emission wavelengths, thereby permitting simultaneous multicolored analysis to be performed in a single step on a single target cell or tissue sample.

A significant problem associated with incorporation of labeled nucleotides into oligonucleotide probes is that the conjugation moieties that are attached to the nucleotide usually interfere with the formation of Watson-Crick base pairing, thus negatively affecting the hybridization of the probe to its target. The has been seen with use of label attached via N4-substituted cytosine nucleotides, because of steric hinderance and the expected shift to the less reactive state of a secondary amine (as seen with N4 labeled cytosine), as compared to the natural G-C bond formed with an unsubstituted cytosine (a primary amine). Any small change or interference with G-C bonding in a small oligonucleotide (25 to 50 bases) can reduce the ability of these oligos to hybridize with the intended targeted sequence.

There remains a need in the art to develop suitable probes designs for incorporating labeled nucleotides in oligonucleotide probes. We demonstrate that a few artificial sequences are viable alternatives for probe labeling and also work both singly and in complex oligonucleotide probe mixtures for detecting or localizing nucleic acid target genes within a cell or tissue sample. The development of such generic sequences and labeling strategy for probe collections has wide application in the medical, genetic, and molecular biological arts.

This interference due to labeling chemistry and hybridization stringency and kinetics is solved herein by designing the oligo to have at least two distinct functional domains, one domain or sequence to be gene specific and involved in the base pair formation, and the second domain to be an artificial, non-specific sequence (in reference to the sample's genome) comprised of spacing nucleotides and the labeled nucleotide. These elements are positioned so that these label-nucleotides are more accessible as haptens for binding proteins (immunoglobulin or avidin(s)) and thus do not interfere with Watson-Crick base pairing in the gene-specific domain.

SUMMARY OF THE INVENTION

The present invention provides a novel strategy to incorporate label into oligonucleotide probes and labeled oligonucleotide probe collections for detecting or localizing nucleic acid target genes within a cell or tissue sample. In particular, the invention relates to non-gene-specific sequences using sequence formulas for making repetitive polymers of such sequences which can be incorporated into collections of oligonucleotide probes for use in in situ hybridization analyses. In addition, using labeled synthetic oligonucleotide polymers, based on sequence formulas, when conjugated to binding proteins, i.e. immunoglobulins, is a very effective and controlled process for labeling such proteins used in immunohistochemical analysis. The present invention provides collections or “cocktails” of oligonucleotide probes for detecting or localizing specific nucleic acid target genes within a cell or tissue sample. The cocktails are useful for detecting the following: the Kappa gene (SEQ ID NOS: 1-16 inclusive); the Lamba gene (SEQ ID NOS: 17-19); the CMV (cytomegalovirus) gene (SEQ ID NOS: 30-50 inclusive); EBER (Epstein-Barr early RNA) gene (SEQ ID NOS: 51-54 inclusive); Alu (SEQ ID NOS: 55-56); PolyA (SEQ ID NO: 57); and the detection tail (SEQ ID NO: 58).

The invention is directed to an oligonucleotide label-domain comprising the sequence (CTATTTT)_(n) and its complement (AAAATAG)_(n) wherein “n” is at least 1.

The invention is also directed to an oligonucleotide probe having at least two distinct functional domains, a first domain comprising the label-domain of claim 2, and a second domain comprising a gene-specific target sequence.

The invention is also directed to a probeset for detecting Kappa immunoglobulin light chain mRNA or corresponding heteronuclear RNA wherein the probes are selected from the group consisting essentially of SEQ ID NOS: 1 through 16, inclusive.

The invention is also directed to a probeset for detecting Lambda immunoglobulin light chain mRNA or corresponding heteronuclear RNA wherein the probes are selected from the group consisting essentially of SEQ ID NOS: 17-29.

The invention is also directed to a probeset for detecting cytomegalovirus (CMV) immediate early RNA and for corresponding mRNA wherein the probes are selected from the group consisting essentially of SEQ ID NOS: 30 through 50.

The invention is also directed to a probeset for detecting Epstein Barr virus (EBV) early RNA, RNA 1 and RNA 2, (EBER) wherein the probes are selected from the group consisting essentially of SEQ ID NOS: 51 through 54.

The invention is also directed to a probeset for detecting Human Alu repetitive satellite genomic DNA sequences wherein the probes are selected from the group consisting essentially of SEQ ID NOS: 55 and 56.

Specific preferred embodiments of the present invention will become evident from the following more detailed description of certain preferred embodiments and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a generic probe structure of the two-domain probe design. This is the oligonucleotide design used for the probes in the gene specific cocktails described in the following examples. Each probe is composed of two domains: a 5′ labeling domain and a 3′ target gene target gene-specific domain. The labeling domain consists of this specific sequence (CTATTTT)n, wherein each cytosine may be labeled with a fluorophore or a cytosine-hapten conjugate, the hapten being fluorescein in this embodiment. This illustration specifically shows nucleic acid sequences for the 301 (SEQ ID NO: 55) and 302 (SEQ ID NO: 56) probes, each of which possesses target gene gene-specific domains corresponding to human repetitive Alu sequences and labeling domains having a fluorescein hapten.

FIG. 2 illustrates the results obtained for in situ hybridization (ISH) analysis of human skin tissue using a probe comprising the labeling domain (330 probe; SEQ ID NO: 58). The absence of a detectable signal indicates that the sequence formula, (CTATTTT)_(n) CT, of the labeling domain common to the oligonucleotides used in these ISH examples is non-specific, and non-reactive in its ability to form Watson-Crick base pairing with human nucleic acid sequences because it does not hybridize.

FIG. 3 illustrates the results obtained for ISH analysis of human skin tissue using a probe comprising the labeling domain and a poly d(T) target gene-specific domain (320 probe; SEQ ID NO: 57). The presence of a detectable signal localized to the cytoplasm indicates that this probe is capable of specifically hybridizing to polyadenylated region of messenger RNA.

FIGS. 4A-4B illustrate the results obtained for ISH analysis of human skin tissue using the 320 probe, wherein the tissue sample was not treated with ribonuclease A prior to in situ hybridization (A), or was treated with ribonuclease A prior to in situ hybridization (B). The decrease in detectable signal in (B) indicates that this probe specifically hybridizes to polyadenylated region common to messenger RNA.

FIGS. 5A-5B illustrate the results obtained for ISH analysis of human spleen tissue using the 320 probe, wherein the hybridization and stringency wash were performed at room temperature (A), or at 37° C. (B). This result illustrates that the intensity of color is related to the stringency of hybridization conditions, with the more intense color indicating less stringent conditions.

FIG. 6 illustrates the results obtained for ISH analysis of the human Raji cell line using the 320 probe. This shows that this probe design also is functional with embedded cell lines as well as embedded tissue.

FIG. 7 illustrates the results obtained for ISH analysis of the human Raji cell line using a probe collection consisting of the 301 and 302 probes.

FIG. 8 illustrates the results obtained for ISH analysis of the human HT cell line using a probe collection consisting of the 301 and 302 probes.

FIG. 9 illustrates the results obtained for ISH analysis of a rat cell line using a probe collection consisting of the 301 and 302 probes. The absence of a detectable signal indicates that this probe collection is specific for human nucleic acid sequences.

FIG. 10 illustrates the results obtained for ISH analysis of an Epstein-Barr virus (EBV)-negative human HT cell line using a probe possessing a target gene-specific domain corresponding to EBV EBER nuclear RNA (SEQ ID NOS: 51 through 54 inclusive).

FIG. 11 illustrates the results obtained for ISH analysis of human spleen tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to EBV EBER 1 and 2 nuclear RNA (SEQ ID NOS: 51 through 54 inclusive).

FIG. 12 illustrates the results obtained for ISH analysis of human tonsil tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to EBV EBER 1 and 2 nuclear RNA (SEQ ID NO:51 through SEQ ID NO: 54).

FIGS. 13A-13B illustrate the results obtained for ISH analysis of human spleen tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to EBV EBER 1 and 2 nuclear RNA (SEQ ID NO:51 through SEQ ID NO:54), wherein the tissue sample was not treated with ribonuclease A prior to in situ hybridization (A), or was treated with ribonuclease A prior to in situ hybridization (B). The decrease in detectable signal in (B) indicates that this probe specifically hybridizes to human EBER 1 and EBER 2 nuclear RNA.

FIG. 14 illustrates the results obtained for ISH analysis of kappa light chain-positive human tonsil tissue using a probe possessing a target gene-specific domain corresponding to human immunoglobulin lambda light chain mRNA (SEQ ID NO: 15).

FIG. 15 illustrates the results obtained for ISH analysis of lymphoma tissues using a probe collection consisting of probes possessing target gene-specific domains corresponding to human immunoglobulin kappa light chain mRNA (SEQ ID NOS: 2-4, SEQ ID NOS:7-12, SEQ ID NOS: 14, 15). The lymphoma tissue in (A) over expresses the kappa light chain and the tissue in (B) over expresses the lambda light chain. The absence of a detectable signal in (B) indicates that the kappa light chain probe collection is specific to kappa light chain mRNA.

FIG. 16 illustrates the results obtained for ISH analysis of lambda light chain-positive human tonsil tissue using a probe possessing a target gene-specific domain corresponding to human immunoglobulin lambda light chain variable region mRNA (SEQ ID NOS:19 through 29).

FIG. 17 illustrates the results obtained for ISH analysis of a lambda light chain-positive human RPMI 8226 cell line using a probe collection consisting of probes possessing target gene-specific domains corresponding to human immunoglobulin lambda light chain mRNA (SEQ ID NOS:19 through 29).

FIGS. 18A-18B illustrate the results obtained for ISH analysis of human spleen tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to human immunoglobulin lambda light chain mRNA (SEQ ID NOS:19 through 29). The tissue in (A) over expresses the lambda light chain and the tissue in (B) over expresses the kappa light chain. The absence of a detectable signal in (B) indicates that the lambda light chain probe collection is specific to human lambda light chain mRNA.

FIG. 19 illustrates the results obtained for ISH analysis of cytomegalovirus (CMV)-positive human lung tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to CMV immediate early RNA (SEQ ID NOS:30-32, SEQ ID NOS: 34-35, SEQ ID NO: 38, SEQ ID NO: 50). (CMV infected cell).

FIG. 20 illustrates the results obtained for ISH analysis of a rat 9 G cell line in which the expression of CMV immediate early RNA has not been induced by cyclohexamide using a probe collection consisting of probes possessing target gene-specific domains corresponding to CMV immediate early mRNA (SEQ ID NOS:30-32, SEQ ID NOS: 34-35, SEQ ID NO: 38, SEQ ID NO: 50).

FIGS. 21A-21B illustrate the results obtained for ISH analysis of a rat 9 G cell line in which the expression of CMV immediate early RNA has been induced by cyclohexamide using a probe collection consisting of probes possessing target gene-specific domains corresponding to CMV immediate early RNA (SEQ ID NOS:30-32, SEQ ID NOS: 34-35, SEQ ID NO: 38, SEQ ID NO: 50). The tissue in (A) is shown at a magnification of 40× and the tissue in (B) is shown at a magnification of 20×.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present invention provides oligonucleotide probes and oligonucleotide probe collections for detecting or localizing nucleic acid target genes within a cell or tissue sample. In particular, the invention relates to collections of oligonucleotide probes for use in in situ hybridization analyses.

More specifically, this invention relates to the use of specific sequence formulas for nucleotide polymers or label-domains to attach a detectable moiety (a label) to oligonucleotide probes or proteins. The specific utility of these sequences or derivatives thereof, is the inert or non-reactive characteristic that does not hybridize to human DNA or RNA at a detectable level under standard stringency of hybridization conditions. These label-domains or polymers were demonstrated to be useful generic sequences for incorporation into oligonucleotide probes for detecting gene-specific sequences within cells or tissue samples in in situ hybridization analyses. Additionally, this inert set of sequences are useful for attaching a label to immunoglobulins or other proteins for detecting haptens and antigens in immunohistochemical analyses.

As used herein, the terms “probe” or “oligonucleotide probe” refers to a nucleic acid molecule used to detect a complementary nucleic acid target gene.

As used herein, the term “hybridization” refers to the process whereby complementary nucleic acid sequences join to form a double-stranded nucleic acid molecule. By labeling the target nucleic acid molecule with, for example, a radioactive or fluorescent tag, interactions between probe and target genes can be detected.

The oligonucleotide probes and oligonucleotide probes of the collections of the present invention are synthesized using conventional methods. See e.g., Methods in Molecular Biology, Vol 20: Protocols for Oligonucleotides and Analogs 165-89 (S. Agrawal, ed., 1993); Oligonucleotides and Analogues: A Practical Approach 87-108 (F. Eckstein, ed., 1991).

In a preferred embodiment of the present invention, oligonucleotide probes possess two distinct domains: a 5′ (or labeling) domain and a 3′ (or gene-specific target) domain (See FIG. 1A). In more preferred embodiments of the present invention, the oligonucleotide probe possesses a labeling domain which consists of the unique sequences (CTATTTT)_(n) and/or (CTATTTT)_(n)CT. Other embodiments are also demonstrated herein, including a triple-domain embodiment having two terminal labeling domains, and a central gene-specific target domain. Specifically, SEQ ID NOS: 125-126 depict this labeling scheme. Yet a further preferred embodiment of a labeling domain is TC(TTTTATC)_(n) or its complement. This sequence is predicted to be as unique as the (CTATTTT)_(n) CT label-domain. The oligonucleotide probes of the present invention are labeled so that hybridization between said probes and target nucleic acids in a particular cell or tissue can be detected. Labels that are acceptable for use in in situ hybridization (ISH) analysis are known to those with skill in the art. Such labels permit interactions between probe and target genes to be detected using, for example, radioactive-based direct detection methods, fluorescence-based direct detection methods, digoxigenin-labeled or biotin-labeled probes coupled with fluorescence-based detection methods, or digoxigenin-labeled or biotin-labeled probes coupled with antibody-enzyme-based detection methods. In preferred embodiments of the present invention, oligonucleotide probes are labeled with fluorescein. In mum preferred embodiments of the present invention, the oligonucleotide probe possesses a labeling domain which consists of the sequence (CTATTTT)_(n) CT, wherein the cytosine nucleotides may be labeled with a fluorophore for direct detection, or a hapten for indirect detection. In either, the fluorescein-cytosine nucleotide conjugate and the fluorescein molecule is linked at the N4 position of cytosine through an OBEA linkage (See Mishra et al., U.S. Pat. No. 5,684,142, which is incorporated herein by reference). In a preferred embodiment, the density of fluorophore attached to the label-domain is at least 7 mole percent, preferably at least 10 mole percent, and most preferably at least 16 mole percent, when measured against the label-domain solely. For example, if probe 401 is considered (a 2-domain probe) it comprises a label-domain of 30 bases including a 3′ terminal CT wherein the C is also labeled, the mole percent is 5/30=16.7 mole percent label. In the overall probe, the mole percent is 8.3.

In some embodiments of the present invention, several target genes are simultaneously analyzed by exposing a cell or tissue sample to a plurality of nucleic acid probes that have been labeled with a plurality of different nucleic acid tags. For example, a plurality of nucleic acid probes can be labeled with a plurality of fluorescent compounds having different emission wavelengths, thereby permitting simultaneous multicolored analysis to be performed in a single step on a single target cell or tissue sample.

The oligonucleotide probes and oligonucleotide probe collections of the present invention may be used in ISH analysis to detect or localize nucleic acid target genes within a cell or tissue sample. ISH may be performed as described, for example, in In Situ Hybridization: Medical Applications (G. R. Coulton and J. de Belleroche, eds., Kluwer Academic Publishers, 1992); In Situ Hybridization: In Neurobiology; Advances in Methodology (J. H. Eberwine, K. L. Valentino, and J. D. Barchas, eds., Oxford University Press, 1994); or In Situ Hybridization: A Practical Approach (D. G. Wilkinson, ed., Oxford University Press, 1992)).

The preferred embodiment of the probes and probe collections of the present invention are best understood by referring to FIGS. 1-21 and Examples 1-2. The Examples, which follow, are illustrative of specific embodiments of the invention, and various uses thereof. They are set forth for explanatory purposes only, and are not to be taken as limiting the invention.

Example 1 Probe Collection Preparation

Probe collections consisting of a plurality of oligonucleotide probes of 55 to 60 bases in length were designed as follows. In this Example, each oligonucleotide probe possessed two distinct domains: a 5′ (or labeling) domain and a 3′ (or target gene-specific) domain (See FIG. 1).

In this embodiment, the labeling domain consists of the sequence (CTATTTT)_(n) CT, wherein the cytosine nucleotide represents a fluorescein-cytosine nucleotide conjugate and the fluorescein molecule is linked at the N4 position of cytosine through an OBEA linkage.

The target gene-specific domain consists of a 25-30 base sequence that is complementary to a specific nucleic acid target gene. Oligonucleotide probes were designed to possess target gene-specific domains corresponding to the human immunoglobulin kappa light chain variable region (See Table 1; oligonucleotide probes 401-416), the human immunoglobulin lambda light chain variable region (oligonucleotide probes 501-515), human cytomegalovirus (CMV) sequences (oligonucleotide probes 221-241), human Epstein-Barr virus (EBV) EBER (Epstein-Barr early RNA) sequences (oligonucleotide probes 100A2, 100C2, 100A1, and 100B1), human repetitive Alu sequences (oligonucleotide probes 301 and 302), and poly d(T) (oligonucleotide probe 320).

Example 2 Label-Domain Design Alu Repetitive Sequence Probe

Four probes all against the Alu human repetitive sequence were used to evaluate label-domain design. The probes numbered 301 (SEQ ID NO: 55), 301A (SEQ ID NO:116), 301A2/2 (SEQ ID NO: 121), and 301A3/2 (SEQ ID NO: 122) are shown in Table 1.

The four probes were evaluated at the concentrations of 100, 75, 50, and 25 ng/ml per mL of probe in the reaction, respectively. This hybridization analysis was done manually, using standard protocols. The target, paraffin-embedded cell line MBA MD 468 (Oncor INFORM™ Her-2/neu Control Slides, Cat. No. S8100, Level 1, available from Ventana Medical Systems, Inc., Tucson, Ariz.) was the target sample and was processed by removing paraffin by standard xylene methods. The tissue was subjected to Ventana's Protease 1 for 12 minutes at 50 degrees C. as a 1:2 dilution with Ventana's APK buffer. The hybridization reaction was accomplished with the addition of probe diluent as 100 ul probe (25% formamide, 5% dextran sulfate, 2×SSC, 1% Triton) to a residual 100 ul volume of 2×SSC/Triton X-100. The slide was heated to 85 degrees C. for 5 minutes and then incubated for 1 hr at 37 degrees C. Standard SSC washes followed for removing excess probe. The hybrids were detected with an antibody against FITC. The mouse antibody was detected colormetrically using Ventana Enhanced Alkaline Phosphatase Blue Detection (cat#760-061). Unless otherwise indicated, all reagents were obtained from Ventana Medical Systems, Inc., Tucson, Ariz. The results were observed by colormetric detection using brightfield microscopy.

The results of these experiments were that signal intensity was a function of the total number of fluorescein hapten conjugated to the probe and signal was of the specific label-domain design. The greater the number of fluoresceins per probe molecule, the greater the signal observed. Comparison of design and placement of haptens on the probe showed that this was not a factor in signal intensity. The two probes that contained five fluoresceins, (301A3/2 (SEQ ID NO: 122) and 301 (SEQ ID NO:55) both yielded equivalent signal. These two probes yielded greater signal that seen for 301A2/2, a probe with a split label-domain design with four fluoresceins. The probe 301A2/2 yielded a signal greater than probe 301A a probe with a single label-domain design at the 5′ end and with three fluoresceins.

Example 3 Label Domain Design EBER Probes

This experiment compared two label-domain designs and sequences to determine whether greater spacing between the fluorescein haptens improves the production of signal during probe detection steps during in situ hybridization analysis.

The tissue used was an EBV-infected human spleen tissue fixed in neutral buffered formalin paraffin embedded section of 4-micron thickness placed on silane plus glass microscope slides. The tissue sections were deparaffinized on a Ventana DISCOVERY™ machine, followed by a 6-min digestion with Ventana's Protease 1, at a temperature of 37 C. The probe was dissolved in hybridization buffer diluent at a concentration of 50 ng/mL as a 100 ul applied to an equal volume of 2×SSC/Triton X-100 residual volume left on the slide after prepared by the Ventana Medical Systems, Inc. automated ISH staining system, Discovery. The probe diluent-mixed with the residual volume on slide for 6 min at 37 C, then the solution was heated to 85 C and held there for a total of 10 min. The slide was then taken to a 37 C temperature and held at that temperature for 1 hour. All of these aqueous reactions on the slide were all done under a film of LIQUID COVERSLIP™, to prevent evaporative loss of water during processing. Each slide after hybridization was washed 3 times with 2×SSC/Triton solution, with a 6 min incubation between each wash, the slide volume being approximately 300 ul (+/−10% vol). The hybrids were detected with an antibody against FITC. The mouse antibody was detected colormetrically using Ventana Enhanced Alkaline Phosphatase Blue Detection (cat#760-061).

The two oligonucleotide probes used for this study probe 100A1 (SEQ ID NO: 53) and 1002A32 (SEQ ID NO: 120). The two differences between these probes were the label-domain sequence and structure. The probe 100A1 label domain was 5′ to gene target domain, contained 5 fluoresceins attached to cytosine residues via the OBEA linker, with the sequence formula of (CTATTTT)₄CT (SEQ ID NO: 58). The label domain of the oligo probe 1002A32, was similar, (SEQ ID NO:125). Besides the different sequence the primary difference was that the fluorescein labeled cytosines were spaced 10 bases apart compared to the oligo 100A1 the cytosine spacing was closer at 7 bases apart. The result of this comparison as deduced by H score analysis were that these oligonucleotide were equivalent as to the amount of signal generated on the slide. The data was that for 100A2, for the 368 cells analysed in a total of 3 fields the H score was 106, and for probe 1002A32 for the 345 cell analysed in three field the H score was 109. The H score is a spectrographic analysis done with microscope that factors into the score background to signal ratio on the tissue section to yield a relative comparison of total target specific signal on the slide. (See reference Giroud, F. Perrin C, and Simony Lafontaine, J.; Quantitative Immunocytochemistry and Immunohistochemistry. Third Conference of the European Society for Analytical Cellular Pathology, 1994; and AutoCyte Quic Immuno User's Manual, 1998, document number PA-029, Co AutoCyte Inc. Burlington N.C. 2721). The histograms and the score sheet indicated that each oligo were equally efficient in yielding a colormetric signal. This indicates that the position of the label domain can be either 3 prime or 5 prime to the gene target sequence or the gene target sequence can be positioned between two label domains.

Example 4 In Situ Hybridization

The probe collections prepared in Example 1 were first diluted in a solution consisting of 20% dextran sulfate (wt/vol), 50% formamide (vol/vol), 2×SSC, 10 mM Tris-HCl, 5 mM EDTA, and 0.05% Brij-35, at a final pH of 7.3. Probe collections were then mixed with an equal volume of a solution consisting of 2×SSC and 0.05% Triton X-100.

Samples for ISH analysis were prepared by cutting formalin-fixed and paraffin-embedded cells or tissue samples into 4 μm sections and placing the sections onto a glass slide. Subsequent processing and ISH of samples was carried out in an automated device, such as the DISCOVERY™ Automated ISH/IHC Stainer (Ventana Medical Systems, Inc., Tucson, Ariz.) described in co-owned and co-pending U.S. Patent App. Ser. Nos. 60/076,198 and 09/259,240, both incorporated herein by reference. To remove paraffin from the samples, the slides were immersed in an aqueous solution, heated for approximately 20 minutes, and then rinsed. The automated deparaffinization procedure is more fully described in U.S. Ser. Nos. 60/099,018, and 09/259,240 both incorporated herein by reference. The samples were then treated with protease and the slides were heated to 85° C. (for hybridization to RNA target genes) or 90-95° C. (for hybridization to DNA target genes) for 4 to 10 minutes.

Hybridization reactions were typically performed in a hybridization buffer consisting of 10% dextran sulfate (wt/vol), 25% formamide (vol/vol), 2×SSC, 5 mM Tris, 2.5 mM EDTA, 0.025% Brij-35, 0.25% Triton X-100, and between 25 to 125 ng/mL of each individual probe molecule. ISH reactions were performed at between 37° C. to 54° C. For ISH using the probe collections described in Example 1, hybridization reactions were optimally carried out for 1 hr at 47° C. (except for the poly d(T) probe, wherein the hybridization reaction was optimally carried out at 37° C. for 1 hr).

The hybridization of fluorescein-labeled probe molecules to a particular target gene in the sample was detected by using a sequential series of binding proteins, i.e., secondary antibody detection. However, it is equally possible to use detect detection when visualizing the bound probes. In secondary detection, first, an anti-fluorescein mouse monoclonal antibody directed against the fluorescein-labeled probe molecule was added to the sample. Next, a biotin-labeled polyclonal goat antibody directed against the mouse antibody was added to the sample. Finally, hybridization reactions were colormetrically detected using a 5-bromo-4-chloro-3-indolyl phosphate/nitroblue tetrazolium (BCIP/NBT) substrate This technique, termed “secondary antibody detection,” is routine for one of skill in the art. Primary and secondary antibodies are available from numerous suppliers, including Ventana Medical Systems, Tucson, Ariz., which are optimized for use on the Ventana autostaining systems (ES®, NexES®, DISCOVERY™, and BENCHMARK™).

FIGS. 2-21 illustrate the results obtained for in situ hybridization analysis of various cell lines or tissue samples using the probes disclosed and claimed herein having the structural motif illustrated in FIG. 1 or probe collections consisting of such probes.

FIG. 1 illustrates a generic probe structure of the two-domain probe design. This is the oligonucleotide design used for the probes in the gene specific cocktails described in the following examples. Each probe is composed of two domains: a 5′ labeling domain and a 3′ target gene target gene-specific domain. The labeling domain consists of this specific sequence (CTATTTT)_(n) CT, wherein the cytosine nucleotide is a cytosine-hapten conjugate, the hapten being fluorescein in this embodiment. This illustration specifically shows nucleic acid sequences for the 301 (SEQ ID NO: 55) and 302 (SEQ ID NO: 56) probes, each of which possesses target gene gene-specific domains corresponding to human repetitive Alu sequences and labeling domains having a fluorescein hapten.

FIG. 2 illustrates the results obtained for in situ hybridization (ISH) analysis of human skin tissue using a probe comprising the labeling domain (330 probe; SEQ ID NO: 58). The absence of a detectable signal indicates that the sequence formula, (CTATTTT)_(n) CT, of the labeling domain common to the oligonucleotides used in these ISH examples is non-specific, and non-reactive in its ability to form Watson-Crick base pairing with human nucleic acid sequences because it does not hybridize.

FIG. 3 illustrates the results obtained for ISH analysis of human skin tissue using a probe comprising the labeling domain and a poly d(T) target gene-specific domain (320 probe; SEQ ID NO: 57). The presence of a detectable signal localized to the cytoplasm indicates that this probe is capable of specifically hybridizing to polyadenylated region of messenger RNA.

FIGS. 4A-4B illustrate the results obtained for ISH analysis of human skin tissue using the 320 probe, wherein the tissue sample was not treated with ribonuclease A prior to in situ hybridization (A), or was treated with ribonuclease A prior to in situ hybridization (B). The decrease in detectable signal in (B) indicates that this probe specifically hybridizes to polyadenylated region common to messenger RNA.

FIGS. 5A-5B illustrate the results obtained for ISH analysis of human spleen tissue using the 320 probe, wherein the hybridization and stringency wash were performed at room temperature (A), or at 37° C. (B). This result illustrates that the intensity of color is related to the stringency of hybridization conditions, with the more intense color indicating less stringent conditions.

FIG. 6 illustrates the results obtained for ISH analysis of the human Raji cell line using the 320 probe. This shows that this probe design also is functional with embedded cell lines as well as embedded tissue.

FIG. 7 illustrates the results obtained for ISH analysis of the human Raji cell line using a probe collection consisting of the 301 and 302 probes.

FIG. 8 illustrates the results obtained for ISH analysis of the human HT cell line using a probe collection consisting of the 301 and 302 probes.

FIG. 9 illustrates the results obtained for ISH analysis of a rat cell line using a probe collection consisting of the 301 and 302 probes. The absence of a detectable signal indicates that this probe collection is specific for human nucleic acid sequences.

FIG. 10 illustrates the results obtained for ISH analysis of an Epstein-Barr virus (EBV)-negative human HT cell line using a probe possessing a target gene-specific domain corresponding to EBV EBER nuclear RNA (SEQ ID NO: 51 through SEQ ID NO: 54).

FIG. 11 illustrates the results obtained for ISH analysis of human spleen tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to EBV EBER 1 and 2 nuclear RNA (SEQ ID NO:51 through SEQ ID NO:54).

FIG. 12 illustrates the results obtained for ISH analysis of human tonsil tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to EBV EBER 1 and 2 nuclear RNA (SEQ ID NO:51 through SEQ ID NO:54).

FIGS. 13A-13B illustrate the results obtained for ISH analysis of human spleen tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to EBV EBER 1 and 2 nuclear RNA (SEQ ID NO:51 through SEQ ID NO:54), wherein the tissue sample was not treated with ribonuclease A prior to in situ hybridization (A), or was treated with ribonuclease A prior to in situ hybridization (B). The decrease in detectable signal in (B) indicates that this probe specifically hybridizes to human EBER 1 and EBER 2 nuclear RNA.

FIG. 14 illustrates the results obtained for ISH analysis of kappa light chain-positive human tonsil tissue using a probe possessing a target gene-specific domain corresponding to human immunoglobulin lambda light chain mRNA (SEQ ID NO:15).

FIG. 15 illustrates the results obtained for ISH analysis of lymphoma tissues using a probe collection consisting of probes possessing target gene-specific domains corresponding to human immunoglobulin kappa light chain mRNA (SEQ ID NOS: 2-4, SEQ ID NOS:7-12, SEQ ID NOS: 14, 15). The lymphoma tissue in (A) over expresses the kappa light chain and the tissue in (B) over expresses the lambda light chain. The absence of a detectable signal in (B) indicates that the kappa light chain probe collection is specific to kappa light chain mRNA.

FIG. 16 illustrates the results obtained for ISH analysis of lambda light chain-positive human tonsil tissue using a probe possessing a target gene-specific domain corresponding to human immunoglobulin lambda light chain variable region mRNA (SEQ ID NOS:19 through 29).

FIG. 17 illustrates the results obtained for ISH analysis of a lambda light chain-positive human RPMI 8226 cell line using a probe collection consisting of probes possessing target gene-specific domains corresponding to human immunoglobulin lambda light chain mRNA (SEQ ID NOS:19 through 29).

FIGS. 18A-18B illustrate the results obtained for ISH analysis of human spleen tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to human immunoglobulin lambda light chain mRNA (SEQ ID NOS:19 through 29). The tissue in (A) over expresses the lambda light chain and the tissue in (B) over expresses the kappa light chain. The absence of a detectable signal in (B) indicates that the lambda light chain probe collection is specific to human lambda light chain mRNA.

FIG. 19 illustrates the results obtained for ISH analysis of cytomegalovirus (CMV)-positive human lung tissue using a probe collection consisting of probes possessing target gene-specific domains corresponding to CMV immediate early RNA (SEQ ID NOS:30-32, SEQ ID NOS: 34-35, SEQ ID NO: 38, SEQ ID NO: 50). Arrow indicates CMV infected cell.

FIG. 20 illustrates the results obtained for ISH analysis of a rat 9 G cell line in which the expression of CMV immediate early RNA has not been induced by cyclohexamide using a probe collection consisting of probes possessing target gene-specific domains corresponding to CMV immediate early RNA (SEQ ID NOS:30-32, SEQ ID NOS: 34-35, SEQ ID NO: 38, SEQ ID NO: 50).

FIGS. 21A-21B illustrate the results obtained for ISH analysis of a rat 9 G cell line in which the expression of CMV immediate early RNA has been induced by cyclohexamide using a probe collection consisting of probes possessing target gene-specific domains corresponding to CMV immediate early RNA (SEQ ID NOS:30-32, SEQ ID NOS: 34-35, SEQ ID NOS: 38 through 50) expression of the CMV immediate early RNA with cyclohexamide. The tissue in (A) is shown at a magnification of 40× and the tissue in (B) is shown at a magnification of 20×.

TABLE 1 Probe SEQ ID Sequence ID 401 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CCAGAGTAGCAGGAGCCCCAGGAGCTGAGC-3′ 1 402 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GGATGGAGACTGGGTCAACTGGATGTCACA-3′ 2 403 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GCAAGCGATGGTGACTCTGTCTCCTACAGC-3′ 3 404 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TCTGTCCCAGATCCACTGCCACTGAACCTT-3′ 4 405 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GCAGCCACAGTTCGCTTCATCTGCACCTTG-3′ 5 406 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TTTCAACTGCTCATCAGATGGCGGGAAGAT-3′ 6 407 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  AAGTTATTCAGCAGGCACACAACAGAGGCA-3′ 7 408 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GGCGTTATCCACCTTCCACTGTACTTTGGC-3′ 8 409 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TAGGTGCTGTCCTTGCTGTCCTGCTCTGTG-3′ 9 410 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GTAGTCTGCTTTGCTCAGCGTCAGGGTGCT-3′ 10 411 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GATGGGTGACTTCGCAGGCGTAGACTTTGT-3′ 11 412 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CTCTCCCCTGTTGAAGCTCTTTGTGACGGG-3′ 12 413 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TGGAACTGAGGAGCAGGTGGGGGCACTTCT-3′ 13 414 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GAAAAAGGGTCAGAGGCCAAAGGATGGGAG-3′ 14 415 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  AGATGAGCTGGAGGACCGCAATAGGGGTAG-3′ 15 416 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GCATAATTAAAGCCAAGGAGGAGGAGGGGG-3′ 16 501 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CCTGAGTGAGGAGGGTGAGGAGCAGCAGAG-3′ 17 502 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  AGACCCAGACACGGAGGCAGGCTGAGTCAG-3′ 18 503 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TGTTGGTTCCAGTGCAGGAGATGGTGATCG-3′ 19 504 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TAAATCATGATTTTGGGGGCTTTGCCTGGG-3′ 20 505 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TGTTGCCAGACTTGGAGCCAGAGAAGCGAT-3′ 21 506 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  AATAATCAGCCTCGTCCTCAGCCTGGAGCC-3′ 22 507 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GGTCCCTCCGCCGAAAACCACAGTGTAACT-3′ 23 508 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TTATGAGACACACCAGTGTGGCCTTGTTGG-3′ 24 509 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CTGCTCAGGCGTCAGGCTCAGATAGCTGCT-3′ 25 511 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  ATGCGTGACCTGGCAGCTGTAGCTTCTGTG-3′ 26 512 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  ATTCTGTAGGGGCCACTGTCTTCTCCACGG-3′ 27 513 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CCTCCCCTGGGATCCTGCAGCTCTAGTCTC-3′ 28 515 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TGAGGGTTTATTGAGTGCAGGGAGAAGGGC-3′ 29 221 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GGAGGTCAAAACAGCGTGGATGGCG-3′ 30 222 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GAGGCTGGATCGGTCCCGGTGTCTT-3′ 31 223 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  AATCCGCGTTCCAATGCACCGTTCC-3′ 32 224 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TAAAAACTGCGGGCACTGGGGACGG-3′ 33 225 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  ACCCGAGATTCGCGTGGAGATCCCA-3′ 34 226 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GAGCAAGGAGCTGCCGAGCGACCAT-3′ 35 227 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  ACACTGGTGGTGGTGGGCATCGTGC-3′ 36 228 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TTCCAAATGCGTCAGCGGTGCAAGC-3′ 37 229 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  AGCTGCCTGCATCTTCTTCTGCCGC-3′ 38 238 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TCTCAGAGGATCGGCCCCCAGAATG-3′ 47 239 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CCTCATCTGACTCCTCGGCGATGGC-3′ 48 240 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CGGGTACAGGGGACTCTGGGGGTGA-3′ 49 241 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GGGTGGGTGCTCTTGCCTCCAGAGG-3′ 50 100A2 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GACCTCGGGTCGGTAGCACCGCACT-3′ 51 100C2 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GGAAGCCTCTCTTCTCCTCCCCCGG-3′ 52 100A1 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CCACAGACACCGTCCTCACCACCCG-3′ 53 100B1 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  GGCTACAGCCACACACGTCTCCTCC-3′ 54 301 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CGAGGCGGGCGGATCACCTGAGGTC-3′ 55 302 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  CGGGAGGCGGAGGTTGCAGTGAGCC-3′ 56 320 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT  TTTTTTTTTTTTTTTTTTTTTTTTTTTTTT-3′ 57 301A 5′-CTATTTTTCTATTTTTCTTTT  CGAGGCGGGCGGATCACCTGAGGTC-3′ 116 302C 5′-CTATTTTTCTATTTTTCTTTT  CGGGAGGCGGAGGTTGCAGTGAGCC-3′ 117 302A4 5′-CTATTTTATACTTTATATTTCATATTTTATCT  CGGGAGGCGGAGGTTGCAGTGAGCC-3′ 118 302A3/2 5′-CTATTTTATATTTATATTTCT  CGGGAGGCGGAGGTTGCAGTGAGCC  ACTATTTTATACTT-3 119 1002A32 5′-CTATTTTATACTTTATATTTCT GACCTCGGGTCGGTAGCACCGCAC TACTATTTTATACTT-3′ 120 301A2/2 5′-CTATTTTTCTT CGAGGCGGGCGGATCACCTGAGGTC TTCTTTTTATCTT-3 121 301A3/2 5′-CTATTTTATACTTTATATTTCT CGAGGCGGGCGGATCACCTGAGGTC ACTATTTTATACTT-3′ 122

TABLE 2 Probe SEQ ID Sequence ID 5′-CTATTTTTCTATTTTTCTTTT 123 5′-CTATTTTATACTTTATATTTCATATTTTATCT 124 330 5′-CTATTTTCTATTTTCTATTTTCTATTTTCT 58 5′-CTATTTTATACTTTATATTTCT...........ACTATTTTATACTT-3 125 5′-CTATTTTTCTT...........TTCTTTTTATCTT-3 126

It should be understood that the foregoing disclosure emphasizes certain specific embodiments of the invention and that all modifications or alternatives equivalent thereto are within the spirit and scope of the invention as set forth in the appended claims. 

1. A synthetic oligonucleotide probe having a 5′ end and a 3′ end and comprising: at least two separate domains, the 5′ end consisting of a first domain consisting of the sequence 5′-(CTATTTT)_(n)CT-3′ wherein “n” is at least 4, wherein the first domain does not hybridize with human nucleic acids under standard stringency conditions and, wherein the first domain is detectably labeled through at least one cytosine in the sequence, and a second domain comprising a target sequence capable of hybridizing to a DNA or RNA sequence, wherein the second domain is not detectably labeled, the beginning of the target sequence starting at the 3′ end of the first domain and the end of the target sequence being the 3′ end of the probe, and the at least two separate domains comprising one uninterrupted polynucleotide sequence.
 2. The synthetic oligonucleotide probe of claim 1 wherein said first domain is detectably labeled with a reporter molecule, or a hapten molecule.
 3. The synthetic oligonucleotide probe of claim 2 wherein the hapten is fluorescein linked to the N4 nitrogen of cytosine through an N⁴-[2,2-oxy-bis-(ethylamine)]-2′-deoxycytidine-5′-nucleotide linker.
 4. The synthetic oligonucleotide probe of claim 2 wherein the reporter molecule is a fluorophore.
 5. The synthetic oligonucleotide probe of claim 4 wherein the fluorophore is present in the first domain at a density of greater than 7 mole percent.
 6. The synthetic oligonucleotide probe of claim 1 wherein at least 7 mole percent of the cytosine(s) in the first domain are linked to a detectable moiety by an N⁴-[2,2-oxy-bis-(ethyl amine)]-2′-deoxycytidine-5′-nucleotide linker.
 7. A synthetic oligonucleotide probe having a 5′ end and a 3′ end and comprising: at least two separate domains, the 3′ end consisting of a first domain consisting of the sequence 5′-(CTATTTT)_(n)CT-3′ wherein “n” is at least 4, wherein the first domain does not hybridize with human nucleic acids under standard stringency conditions and, wherein the first domain is detectably labeled through at least one cytosine in the sequence, and a second domain comprising a target sequence capable of hybridizing to a DNA or RNA sequence, wherein the second domain is not detectably labeled, the beginning of the target sequence starting at the 5′ end of the first domain and the end of the target sequence being the 5′ end of the probe, and the at least two separate domains comprising one uninterrupted polynucleotide sequence.
 8. The synthetic oligonucleotide probe of claim 7 wherein the first domain is detectably labeled with a reporter molecule, or a hapten molecule.
 9. The synthetic oligonucleotide probe of claim 8 wherein the hapten is fluorescein linked to the N4 nitrogen of cytosine through an N⁴-[2,2-oxy-bis-(ethylamine)]-2′-deoxycytidine-5′-nucleotide linker.
 10. The synthetic oligonucleotide probe of claim 8 wherein the reporter molecule is a fluorophore.
 11. The synthetic oligonucleotide probe of claim 10 wherein the fluorophore is present in the first domain at a density of greater than 7 mole percent.
 12. The synthetic oligonucleotide probe of claim 7 wherein at least 7 mole percent of the cytosine(s) of the first domain are linked to a detectable moiety by an N⁴-[2,2-oxy-bis-(ethylamine)]-2′-deoxycytidine-5′-nucleotide linker.
 13. A plurality of synthetic oligonucleotide probes for detecting Kappa immunoglobulin light chain mRNA or corresponding heteronuclear RNA wherein the probes are selected from the group consisting of SEQ ID NOS: 1 through 16 inclusive. 